Expressive text-to-speech approaches
نویسندگان
چکیده
The core concern of this paper is the modelling and the tractability of expressiveness in natural voice synthesis. In the first part we quickly discuss the imponderable gap between natural and singing voice synthesis approaches. In the second part we outline a four level model and a corpus-based methodology in modelling expressive forms—an essential step towards expressive voice synthesis. We then try to contrast them with recurrent concerns in singing voice synthesis. We finally undertake a first reflection about a possible transposition of the approach to singing voice. We conclude with some program considerations in Research and Development for the singing voice synthesis, inspired from natural voice synthesis techniques.
منابع مشابه
Expressive speech synthesis: a review
The objective of the present work is to provide a detailed review of expressive speech synthesis (ESS). Among various approaches for ESS, the present paper focuses the development of ESS systems by explicit control. In this approach, the ESS is achieved by modifying the parameters of the neutral speech which is synthesized from the text. The present paper reviews the works addressing various is...
متن کاملStudy on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملبرجسته سازی در خطبۀ فدکیه حضرت زهرا(ع)
Foregrounding is one of the contemporary literary theories, which from a literary perspective to texts, in prose or verse, endeavors to explain and analyze those effective features and elements in the body of the discourse which rhetorically distinguish literary texts from ordinary ones. According to the Formalists, foregrounding is achieved through diminishing or increasing the rules. In other...
متن کاملModeling the acoustic correlates of expressive elements in text genres for expressive text-to-speech synthesis
This paper proposes a novel approach for describing the expressive elements in text genres and modeling their acoustic correlates for expressive text-to-speech synthesis (TTS). We apply the three-dimensional PAD (pleasure-displeasure, arousal-nonarousal and dominance-submissiveness) model in describing expressivity. In particular, we define a set of principles for annotating the P and A values ...
متن کاملIndividual Variability in the Discrimination of Audiovisual Spontaneous vs. Acted Expressive Speech
Though a very large majority of studies focused on expressive speech have used acting as a convenient method for obtaining utterances with the same phonetic contents expressing various affects, the reliability of such material for the modeling of vocal expressions of spontaneous expressive speech has come into debate during the last decade (Campbell, 2000). Such reservations have incited a grow...
متن کاملeXTRA: A Culturally Enriched Malay Text to Speech System
This paper concerns the incorporation of naturalness into Malay Text-to-Speech (TTS) systems through the addition of a culturally-localized affective component. Previous studies on emotion theories were examined to draw up assumptions about emotions. These studies also include the findings from observations by anthropologists and researchers on culturalspecific emotions, particularly, the Malay...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007